This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 

• BLACK BORDERS 

• TEXT CUT OFF AT TOP, BOTTOM OR SIDES 

• FADED TEXT 

• ILLEGIBLE TEXT 

. • SKEWED/SLANTED IMAGES 

• COLORED PHOTOS 

• BLACK OR VERY BLACK AND WHITE DARK PHOTOS 

• GRAY SCALE DOCUMENTS 

IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



f ' 



WORLD INTELLECTUAL PROPERTY ORGANIZATION 
International Bureau 




INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 5 • 

C12N 7/00, 7/01, 15/00, 15/48, 15/85 


Al 


(11) International Publication Number: WO 94/29437 
(43) International Publication Date: 22 December 1994 (22.12.94) 


(21) International Application Number: PCT/US94/06415 

(22) International Filing Dates 7 June 1994 (07.06.94) 

(30) Priority Data: 

08/073,345 7 June 1993 (07.06.93) US 

(71) Applicant: UNIVERSITY OF MEDICINE & DENTISTRY OF 

KTTTTtr Tpnopv rrTO/TTCI- 1 tf\ Administration Pjfirrmlex ^0 

Bergen Street, University Heights, Newark, NJ 07101-3000 
(US). 

(72) Inventor: DORNBURG, Ralph, C; 15 Terry Avenue, South 

River, NJ 08882 (US). 

(74) Agent: MUCCINO, Richard, R.; P.O. Box 1267, Princeton, NJ 
08542 (US). 

% 


(81) Designated States: CA, JP, European patent (AT, BE, CH, DE, 
DK, ES, FR, GB, GR, IE, IT, LU, MC, NL, PT, SE). 

Published 

With international search report. 



(54) Title: HIGHLY-EFFICIENT, SELF-INACITVATING, RECOMBINATION-FREE, U3-FREE RETROVIRAL VECTORS 



(57) Abstract 

The present invention pertains to recombinant retrovirus vectors having a U3-free 5' LTR, a partially deleted 3* LTR, all essential 
cis-acting sequences for replication, an internal promoter recognizable by a selected host cell, and a non-retroviral gene under the control 
of the recognized promoter wherein (a) the U3-free 5' LTR is positioned 5' of the aon-retroviral gene and has a transcriptional promoter 
and enhancer different from that of the original retroviral promoter and enhancer replacing the original U3 region of the 5 'LTR; (b) the 
partially deleted 3' LTR is positioned 3' of the non-retro viral gene and has no U3 sequences except for those required at the attachment 
site for viral integration; (c) an exogenous polyadenylatton addidon signal sequence recognized by the selected host cell and positioned on 
the vector 3' to the 3 'LTR viral integration site; and (d) the internal recognized promoter is positioned adjacent to the non-retroviral gene 
on the vector to permit expression of the non-retroviral gene in the host cell; whereby the vector can produce progeny virus in a helper cell 
with the progeny virus being capable of infecting the selected host cell and forming a provinis in the host cell, with the non-retroviral gene 
being expressible in the host cell, but the provinis in the host cell will be replication incompetent even in the presence of a helper virus. 



tie. 



FOR THE PURPOSES OF INFORMATION ONLY 

Codes used to identify States party to the PCT on the front pages of pamphlets publishing international 
applications under the PCT. 



AT 


AustriA 


AU 


AustnlLt 


BB 


Barbados 


BE 


Belgium 


BF 


Burkina Faso 


BG 


Bulgsrii 


BJ 


Benin 


BR 


Brazil 


BY 


Belarus 


CA 


Canada 


CF 


Central African Republic 


CG 


Coogo 


CH 


Switzerland 


ci 


Cote d'lvolre 


CM 


Cameroon 


CN 




cs 


Czechoslovakia 


cz 


Czccb Republic 


DE 


Germany 


DK 


Denmark 


ES 


Spain 


Fl 


Finland 


FR 


France 


GA 


Gabon 



GB 


Uoited Kingdom 


GE 


Georgia 


GN 


Guinea 


GR 


Greece 


HU 


Hungary 


IE 


Ireland 


IT 


Italy 


JP 


Japan 


KE 


Kenya 


KG 


Kyrgystan 


KP 


Democratic People's Republic 




of Korea 


KR 


Republic of Korea 


KZ 


Kazakhstan 


U 


Liechtenstein 


LK 


Sri Lanka 


LU 


Luxembourg 


LV 


Latvia 


MC 


Monaco 


MD 


Republic of Moldova 


MG 


Madagascar 


ML 


Mali 


MN 


Mongolia 



MR 


Mauritania 


MW 


Malawi 


NE 


Niger 


NL 


Netherlands 


NO 


Norway 


NZ 


New Zealand 


PL 


Poland 


PT 


Portugal 


RO 


Romania 


RU 


Russian Federation 


SD 


Sudan 


SE 


Sweden 


SI 


Slovenia 


SK 


Slovakia 


SN 


Senegal 


TD 


Chad 


TG 


Togo 


TJ 


Tajikistan 


TT 


Trinidad and Tobago 


UA 


Ukraine 


US 


United States of America 


CZ 


Uzbekistan 


VN 


Viet Nam 



WO 94/29437 



PCT/US94/06415 



HIGHLY-EFFICIENT, SELF-INACTIVATING, 
RECOMBINATION-FREE, U3-FREE RETROVIRAL VECTORS 

Background of the invention 

^ field of the Invention 

This invention relates to recombination-free, highly-efficient 
retroviral vectors that self-inactivate after one round of retroviral replication. The 
present vectors allow cell-type specific gene expression from tissue specific 
promoters and enhancers. The recombination-free retroviral vectors eliminate risks 
of downstream activation of cellular proto-oncogenes (tumorigenesis) that can occur 
with conventional vectors. This invention also relates to a retrovirus produced by 
the recombinant retrovirus vector and a host cell containing the provirus of the 
present invention. 

Description of the Background 

The disclosures referred to herein to illustrate the background of the 
invention and to provide additional detail with respect to its practice are 
incorporated herein by reference. For convenience, the disclosures are referenced 
in the following text and respectively grouped in the appended bibliography. 

Retroviral vectors are the most efficient tools to introduce genes into 
vertebrate cells. Clinical experiments have been conducted to use retrovirus vectors 
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to cure a genetic disease in humans (adenosine deaminase (ADA) deficiency). 
Besides correcting inborn errors of metabolism, gene therapy is also being tested in 
clinical trials to cure cancer and various other diseases (Science 1992, Vol. 258, pp. 
744-746). 

5 

Retroviral vectors are basically retroviral particles that contain a 
crippled viral genome in which all viral protein coding sequences have been 
replaced with the gene(s) of interest. As a result, such viruses cannot further 
replicate after one round of infection without the help of a helper virus. Retroviral 

10 vector particles are produced by helper cells (Figure 1). Such helper cells contain 
plasmid constructs which express all retroviral proteins necessary for particle 
production and replication. After the introduction (transfection) of the retroviral 
vector genome into such helper cells, the vector genome (an RNA genome) is 
encapsidated into virus particles (due the presence of specific encapsidation 

15 sequences). Virus particles are released from the helper cell carrying a genome 
containing only the gene(s) of interest (Figure 1). After infection of a fresh target 
cell, the RN>fi genome is reverse transcribed into DNA. The DNA copy is 
integrated into the host genome (Figure 2A). The integrated viral DNA is called 
the provirus. In the last decade, several retroviral vector systems, derived from 

20 chicken or murine retroviruses, have been developed for the expression of various 
genes (for reviews see Temin, 1987; Gilboa, 1990). 

Retroviral vectors have several limitations. For example, one major 
concern is the possible activation of proto-oncogenes as a result of the integration of 

25 the vector into the target cell genome. The activation of proto-oncogenes which is 
mediated by the viral LTR (Long Terminal Repeats) promoter and enhancer can 
lead to the malignant transformation (cancer) of the infected cell. Another problem 
with current retroviral vectors is regulated gene expression. Due to the presence of 
the retroviral LTR promoter and enhancer, it is impossible to design reliable vectors 

30 for tissue specific gene expression. 

To avoid these two shortcomings, retroviral vectors have been 
developed from murine leukemia virus (MLV) and spleen necrosis virus (SNV) that 
self-inactivate after one round of retroviral replication (Yu et al, 1986; Dougherty 
35 and Temin, 1987; United States patent no. 4,980,289 to Temin et al.). This has 
been achieved by deleting parts or almost all sequences of the retroviral U3 region 
of the right LTR (Figure 2B), As a result of the mechanism of the retroviral life- 
cycle, after one round of replication, a retroviral provirus is formed that does not 
contain LTR promoters. 
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However, in the case of MLV derived vectors, the efficiency of gene 
transduction was about four orders of magnitude less than that of a vector with two 
complete LTRs. In the case of SNV, efficiency of gene transduction decreased 
5 only about tenfold. However, detailed investigations revealed that the U3-region of 
the right LTR was reconstituted in the helper cell (Olson et al., 1992). This 
reconstitution of the U3-region was the result of recombination (or gene conversion) 
of the vector with retroviral sequences present in the helper cell or in the other 
LTR. 

10 

To prevent such recombination events, new SNV-derived helper cells 
were constructed that express retroviral particle proteins from promoters and 
enhancers different from those present in the retroviral genome. Such new helper 
cells were termed DSN cells. DSN cells contain two separate plasmids for the 

15 expression of gag-pol and env proteins (Dougherty and Temin, 1989). One plasmid 
contains the gag-pol gene of SNV which is expressed from the CMV promoter. 
The other plasmid contains the envelope gene unit of SNV which is expressed from 
the RSV promoter. Experiments performed with these new helper cells and SNV 
derived U3-minus vectors (e.g., pJD220SVHY, see Figure 3) revealed that the 

20 frequency of the recombination event(s) leading to a vector with two wild-type 
LTRs was markedly reduced. However, vectors containing two wild-type LTRs 
were still recovered from transfected DSN helper cells (Dougherty et al., 1989). 
This finding showed that the (unaltered) U3-region of the left LTR of the vector 
construct can also serve as a template for the recombination (or gene conversion) 

25 event resulting in the reconstitution of the U3-region of the right LTR. 

Brief Description Of The Figures 

FIGURE 1 is a diagram illustrating retroviral helper cells producing 
recombination-free, self-inactivating retroviral vectors. 

FIGURE 2 is a diagram illustrating retroviral vector replication. 

FIGURE 3 is a diagram illustrating retroviral vectors derived from 
spleen necrosis virus (SNV). 



30 



35 
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FIGURE 4 is a diagram illustrating the nucleotide sequences at the 
junction of the cytomegalovirus (CMV) with spleen necrosis virus sequences as 
present in the pPOlll and pP0115 vector series. 

5 

Summary Of The Invention 

The present invention pertains to a recombinant retrovirus vector 
10 having a U3-free 5' LTR, a partially deleted 3' LTR, all essential cis-acting 
sequences for replication, an internal promoter recognizable by a selected host cell, 
and a non-retroviral gene under the control of the recognized promoter wherein: 

(a) the U3-free 5 1 LTR is positioned 5' of the non-retroviral gene 
and has a transcriptional promoter and enhancer different from that of the original 

15 retroviral promoter and enhancer replacing the original U3 region of the 5'LTR; 

(b) the partially deleted 3* LTR is positioned 3* of the non-retroviral 
gene and has ho U3 sequences except for those required at the attachment site for 
viral integration; 

(c) an exogenous polyadenylation addition signal sequence 
20 recognized by the selected host cell and positioned on the vector 3' to the 3* LTR 

viral integration site; and 

(d) the internal recognized promoter is positioned adjacent to the 
non-retroviral gene on the vector to permit expression of the non-retroviral gene in 
the host cell; 

25 whereby the vector can produce progeny virus in a helper cell with 

the progeny virus being capable of infecting the selected host cell and forming a 
provirus in the host cell, with the non-retroviral gene being expressible in the host 
cell, but the provirus in the host cell will be replication incompetent even in the 
presence of a helper virus. 

30 

. The present invention also pertains to a retrovirus produced by the 
recombinant retrovirus vector and a host cell . containing the provirus of the present 
invention. 

35 
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Detailed Description Of The Invention 

This invention relates to recombination-free, highly-efficient 
5 retroviral vectors that self-inactivate after one round of retroviral replication. The 
vectors allow cell-type specific gene expression from tissue specific promoters and 
enhancers and eliminate risks of downstream activation of cellular proto-oncogenes 
(tumorigenesis) that can occur with conventional vectors. The present self- 
inactivating vectors eliminate the problems of recombination and give rise to high 

10 gene transfer efficiency, up to 1,000 fold higher than that of existing self- 
inactivating vectors. The vectors described herein are derived from spleen necrosis 
virus but vector modification may be made with any retroviral vector leading to 
similar results. Thus, the novel vectors have a high potential for use in human gene 
therapy and in other gene transfer applications in which the cell-type specific gene 

15 expression is required, e.g., tissue specific gene expression in transgenic animals. 

o eliminate vector recombination and improve the efficiency of 
gene transduction, two modifications were made to a self-inactivating retroviral 
vector. First, the U3 region of the left LTR was replaced by the immediate early 
20 gene promoter and enhancer of the cytomegalovirus (CMV, Figures 3 and 4). This 
promoter was shown to be the strongest among a series of promoters tested for gene 
expression in a large variety of cell-types including the SNV promoter in D17 cells. 
In order not to abolish retroviral replication, the substitution of the U3 promoter by 
the CMV promoter was done in such a way that initiation of transcription started at 
25 the beginning of R as in the wild-type virus. In particular, if RNA transcription is 
initiated upstream of the original initiation site, first strand cDNA synthesis would 
result in a cDNA product abolishing replication. It may be transferred and 
hybridize to the R-region of the second RNA molecule. However, such a cDNA 
most probably could not function as a primer, since the immediate 3 1 nucleotides of 
30 that cDNA would not find homologous sequences for hybridization. Since the 
precise initiation of transcription was difficult to predict, three constructs 
(designated as pPOlll-Rl, pPOHl-R2, and pP0111-R3) were made in which 
variable amounts of the U3 nucleotides upstream of the first nucleotide of R were 
retained (Figures 3 and 4). The function as well as frequencies of recombination of 
35 these constructs was tested as described earlier (Dornburg and Temin, 1988; Olson 
et al., 1992) and as described below. Second, the encapsidation region, as present 
in previous SNV derived vectors, was extended. Such additional sequences 
increase the efficiency of encapsidation of the vector genome into retroviral 
particles (Embretson and Temin, 1987). 
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In one embodiment of the present invention, the vector is a 
recombinant retrovirus vector having a normally replication incompetent retrovirus 
gene sequence with a foreign eukaryotic gene. The retrovirus gene sequence is 
5 prepared with a deficiency in the retrovirus promoter so that the vector can still 
produce progeny virus in a helper cell, with the progeny virus being capable of 
infecting a selected eukaryotic host cell, forming a provirus, and expressing the 
eukaryotic gene in the host cell, but the provirus will be defective in the retrovirus 
promoter sequence. Preferably, the vector is a recombinant plasmid. In another 

10 embodiment, a foreign internal promoter is positioned adjacent to the foreign 
eukaryotic gene on the vector to permit expression of the foreign eukaryotic gene in 
the eukaryotic host cell without initiating retroviral provirus gene expression. In 
one form, the reading direction of the foreign promoter is inverted relative to the 
normal reading direction of the retrovirus gene sequence, and a foreign 3' RNA 

15 processing sequence is positioned on the side of the foreign eukaryotic gene 
sequence which is opposite to the foreign promoter. In another embodiment, the 
retrovirus is abnormally replication incompetent retrovirus of the type having a 
retrovirus portion and a foreign eukaryotic portion, the retrovirus portion having a 
deficient promoter portion, such that the virus is capable of infecting a eukaryotic 

2 0 host cell, forming a provirus, and expressing a eukaryotic protein coded for by the 
foreign eukaryotic portion in the host cell, but the provirus will be defective in a 
retrovirus promoter such that retroviral provirus gene expression doesn't take place 
in the host cell. 

25 The present invention allows one to select a eukaryotic gene of 

interest, insert the gene into a vector designed in accordance with the present 
invention, transfect a helper cell with the vector, harvest virus stock from the helper 
cell, use the harvested progeny virus to infect a target cell, and have the proviruses 
which are formed in the target cells express the inserted eukaryotic gene without 

30 expressing any retroviral proteins. Since there is no retroviral promoter that is 
active on the, provirus, endogenous helper proteins cannot trigger production of a 
virus from the provirus. Since there is no retroviral promoter in the provirus, the 
provirus cannot provide a retrovirus signal that might trigger the host cell to act in 
an unintended way. The lack of retroviral promoter stops production of retroviral 

35 RNA. This system renders much more likely the acceptability of recombinant 
retrovirus as drugs for vertebrates. 

In the present invention, the enhancer and the promoter sequences of 
the retrovirus present at U3 in SNV have been deleted from the right side of the 
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DNA sequence in the plasmid vector (3'LTR). Only the sequences required at the 
attachment site for viral integration in the 3' LTR are present. The U3 5' LTR has 
a transcriptional promoter and enhancer different from that of the original retroviral 
promoter and enhancer to replace the original U3 region of the 5' LTR. The total 
5 lack of homology in the two U3 sequences prevent recombination. The U3 
sequence normally present in the 5 1 LTR has been reported to be used as a template 
to repair a partially deleleted U3 sequence in the 3 ' LTR. By eliminating the U3 
sequence in the 5* LTR, the template for repair of the partially deleleted U3 
sequence in the 3' LTR is also eliminated. The vector DNA is used to transfect 

10 helper cells in a conventional manner. Because transcription from the vector begins 
at R on the left side, and because the rombter on the left side is not defective, virus 
can be harvested from the transfected helper cell in the conventional manner. 
Target cells can then be infected with the harvested virus. Since the right side U3 
supplies the coding sequences for both U3 segments in the resulting provirus, the 

15 transcriptional promoter which was originally deleted on one side of the plasmid 
DNA shows up as being deleted from both sides in the resulting provirus. The 
vector therefore permits a stock of the retrovirus progeny virus to be grown up, yet 
will not permit further replication after one infection cycle. 

20 If an internal promoter is not inserted, there will be no promoter to 

produce the desired foreign eukaryotic gene expression. Preferably such a 
promoter is positioned immediately adjacent to the foreign eukaryotic of interest so 
that no intervening retrovirus genes RNA is expressed. Because deletion of most of 
the U3 in SNV resulted in a loss of correct 3 ' end processing of viral RNA, even 

25 though AAUAAA was still present, a polyadenylation site was added to the vector. 
Moreover, problem recombinations are unlikely because the U3 sequences in the 
vector are not homologous. 

Abbreviations used in the present invention are as follows: pro- 
30 promoter; enh-enhancer; PBS-primer binding site for DNA synthesis; PPT- 
polypurine track for DNA synthesis; E-encapsidation sequences for RNA 
packaging; attR-f -a sequence that will form the right side of the attachment site 
relating to integration; attL-t— the sequence that will form the left side of the 
attachment side relating to integration; attL+-the deletion of the original provirus 
35 left-side attachment site; and attR+-the deletion of the original right side 
attachment site. 

The term "oligonucleotide" as used herein refers to primers, probes, 
oligomer fragments to be detected, oligomer controls, and unlabeled blocking 
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oligomers. Oligonucleotide are molecules comprised of two or more 
deoxyribonucleotides or ribonucleotides. The term "primer" as used herein refers 
to an oligonucleotide, preferably an oligodeoxyribonucleotide, either naturally 
occurring such as a purified restriction digest or synthetically produced, which is 
5 capable of acting as a point of initiation of synthesis when subjected to conditions in 
which synthesis of a primer extension product, which is complementary to a nucleic 
acid strand, is induced, i.e., in the presence of nucleotides, an agent for 
polymerization such as a DNA polymerase, and a suitable temperature and pH. 
The primer must be sufficiently long to prime the synthesis of extension products in 
10 the presence of the polymerization agent. 

FIGURE 1 is a diagram illustrating retroviral helper cells producing 
recombination-free, self-inactivating retroviral vectors. FIGURE 1A shows cells 
(preferentially D17 cells = canine cells) transfected with two different plasmids 
15 expressing the retroviral gag-pol and env genes, respectively. These genes are 
expressed from promoters and enhancers different from the vector long terminal 
repeat LTR plomoter and enhancer. FIGURE IB shows that after the transfection 
of the retroviral vector genome into such helper cells, viral particles are produced 
that contain the RNA vector genome with the gene of interest. 

20 

FIGURE 2 is a diagram illustrating retroviral vector replication. 
FIGURE 2A shows an example of a conventional retroviral vector provirus as 
present in the helper cell shown at the top. This vector contains wild-type long 
terminal repeats with full-length U3, R, and U5 regions. RNA transcription yields 

25 the RNA transcript shown below. This RNA contains an encapsidation sequence 
(E) and all other cis-acting sequences necessary for replication (pbs = primer 
binding site, ppt = polypurine tract). After infection of a fresh target cell the 
retroviral genome is reverse transcribed into a double-stranded DNA copy. During 
this process, the U3-region which contains the retroviral promoter and enhancer and 

30 which is present at the 3* end of the RNA genome is duplicated and attached to the 
5' end of the DNA copy. In a similar way, the U5 region, present only at the 5 1 
end of the RNA genome, is also duplicated and attached to the 3 f end of the DNA 
copy. The double-stranded DNA copy is integrated into the genome of the infected 
cell. Due to the presence of two long terminal repeats which contain the retroviral 

35 enhancer and promoter, tissue specific gene expression cannot be obtained. 
Further, the downstream enhancer and promoter can activate gene expression of 
genes located downstream of the integrated provirus (indicated by an arrow). 
FIGURE 2B illustrates the principle of a self-inactivating retroviral vector. The 
vector present in the helper cell is shown at the top. This vector is almost identical 
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to the one shown above. However, the long terminal repeat at the right side has 
been modified in the following way: most of U3-sequences including the promoter 
and enhancer sequences have been deleted. Maintenance of some U3-sequences 
that form the attachment site (att) is essential for the efficient integration of the 
5 vector genome. The RNA transcribed from the U3-minus vector virus does not 
contain the retroviral promoter and enhancer. Thus, after one round of retroviral 
replication, a promoter-less retroviral provirus is formed. Genes can be expressed 
from internal tissue-specific promoters (pro). 

10 FIGURE 3 is a diagram illustrating retroviral vectors derived from 

spleen necrosis virus (SNV). The SNV genome (provirus) is shown at the top. An 
example of a standard retroviral derived from SNV is shown below (pJD214HY). 
In all SNV-derived standard vectors, the protein coding regions (gag-pol and env, 
the region from Sail to the 3' end of env) have been replaced with the gene(s) of 

15 interest (in pJD214HY with hygromycin resistance gene, referred to as hygro). 
pJD220SVHY is a first generation self-inactivating retroviral vector in which the 
retroviral promoter and enhancer of the right LTR have been deleted (designated as 
U3 minus; for more details, see Figure 2B). To increase efficiency of replication, 
the polyadenylation sequence of simian virus 40 (SV40ter) has been inserted 

20 downstream of the U3-minus LTR. To avoid problems of recombination of first 
generation U3-minus vectors in the pPOl 1 1 vectors, the U3 region of the left LTR 
has been almost completely deleted (except a few nucleotides to guarantee correct 
initiation of transcription) and substituted for the immediate early promoter and 
enhancer of the human cytomegalovirus (CMV, for more details, see also Figure 

25 4). Further, this second generation, U3-free vector contains additional retroviral 
sequences (from Sail to Eagl) to increase the efficiency of encapsidation. A 
universal, second generation U3-free vector (pPOH5-Rl) contains a multiple 
cloning site replacing the promoter (SV40pro) and hygromycin B resistance gene 
(hygro) as present in pJD220SVHY and pPOlll-Rl (from Eagl to Clal). The 

30 multiple cloning site has been derived from pBluescript II (Eagl to Clal) and 
allows the easy insertion of various genes and promoters to give new retroviral 
vectors. 

FIGURE 4 is a diagram illustrating the nucleotide sequences at the 
35 junction of the cytomegalovirus (CMV) with spleen necrosis virus sequences as 
present in the pPOlll and pPOH5 vector series. An except of the SNV genome 
showing the TATA box and initiation, of transcription is shown at the top. The 
Sacl site downstream of the TATA box of tl>e CMV immediate early promoter 
(shaded sequence) was used for cloning to connect the CMV promoter and enhancer 
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to the SNV sequences. pPOHl-R2 and pP0111-R3 are 3 or 9 nucleotides shorter 
than pPOlll-Rl, respectively. 

The function of the new vectors was tested in a tissue culture system 
5 as follows and as described earlier (Olson et al M 1992): The new vector constructs 
pPOlll-Rl, pP0111-R2, and pP0111-R3 were transfected into helper cells. In 
parallel, the retroviral vectors pJD214HY, a standard retroviral vector with two 
complete wild-type LTRs, and pJD220SVHY, the first generation self-inactivating 
vector from which the new vectors have been derived (Figure 3), were also 
10 transfected into helper cell-lines. Transfected cells were selected for hygromycin 
resistance and cell-lines were established. These resulting helper cell lines are 
designated as step 1 cells. Virus was harvested from confluent step 1 cultures and 
fresh helper cells were infected. The infected helper cells were termed step 2 cells. 
Virus titers were determined (Table 1). We show that the new vectors pPOlll-Rl 
15 to pPOHl-R3 were more efficient than pJD220SVHY. They were almost as 
efficient as a vector with two wild- type LTRs (pJD214HY). 
\ 

To test for recombination, virus particles were harvested from 
confluent step 2 cell cultures (mass infection) and fresh D17 cells were infected. 

20 Detection of hygromycin resistant colonies indicates that recombination 
reconstituted the U3 region: due to the lack of control sequences in RNA 
transcripts derived from completely U3- minus proviruses, such vectors are not 
further passaged by retroviral proteins. Thus, only vectors with complete, repaired 
LTRs are transferred to new target cells. We found that pJD220SVHY was further 

25 passaged with high efficiency confirming earlier findings (Olson et al., 1992). The 
new vectors pPOlll-Rl to R3, however, were not further transferred from step 2 
helper cells (Table 1). These data show that the new vectors are resistant to 
recombination events that reconstitute the LTR region. 
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TABLE 1 

EFFICIENCY OF INFECTIVTTY AND RECOMBINATION OF SELF- 
INACTIVATING RETROVIRAL VECTORS DERIVED FROM SNV 

5 

VIRUS TITER (CFU) 



VECTOR NAME 1 . INFECTION 2. INFECTION 



pJD214HY 


6 X 105 


5 X 106 


pJD220SVHY 


8 x 104 


1 X 103 


pPOlll-Rl 


4 X 105 


0* 


pP0111-R2 


2.5 X 105 


0* 


pPOlll-Rl 


1.5 X 105 


0* 



\ 

* No hygromycin resistant colonies were obtained with pPOlll vectors. 



15 

The retroviral vectors (see also Figure 3) were transfected into helper 
cells followed by hygromycin selection. Virus was harvested from confluent 
cultures and fresh helper cells were infected (referred to as first infection). Infected 
cells were selected for hygromycin resistance, and virus titers were determined 

20 (expressed as colony forming units (CFU) per ml of supernatant medium). Cell- 
lines were established from infected tissue culture plates which contained more than 
1,000 independent hygromycin resistant colonies. Virus was harvested from 
confluent cell cultures and fresh D17 cells were infected (referred to as second 
infection). Infected cells were selected for hygromycin resistance and virus titers 

2 5 were determined . 

Examples 
Materials and Methods 

30 

Plasmid construction. 

The immediate early promoter and enhancer of cytomegalovirus, 
CMV (a 900 bp Hind3 fragment obtained from plasmid pJDCMV19SV, Dougherty 
35 et al., 1990) was cloned into plasmid pRD15 (described in Sheay et al., submitted 
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for publication). The resulting plasmid was termed pPOlOl. A DNA fragment 
containing most of the LTR, the encapsidation region, and parts of the gag-pol 
region of spleen necrosis virus (a Ball fragment derived from plasmid pSW253, 
described by Watanabe and Temin, 1982) was cloned into the blunted Sacl site of 
5 pBluescript II KS, The resulting plasmid was termed pPO102. Plasmid pPO102 
was digested with Sacl and re-ligated after treatment with Klenow polymerase 1 
(creation of blunt ends) to eliminate a Sacl site present in the encapsidation 
sequence. The resulting plasmid was termed (pPO102-S). Using PCR technology 
and plasmid pPO102-S as template, three DNA fragments were generated 

10 comprising slightly different amounts of the SNV R, U5, and encapsidation region: 
a DNA fragment termed Rl comprises map units 383 to 986 of the SNV genome, 
fragment R2 comprises map units 385 to 986, and fragment R3 comprises map units 
392 to 986. All PCR primers contained Sacl recognition sites at their 5' ends. 
After Sacl digestion, fragments Rl, R2, and R3 were cloned into the Sacl site of 

15 pBluescript II KS. The resulting plasmids were termed pPO103-Rl, pPO103-R2 
and pPO103-R3, respectively. Fragments Rl to R3 were isolated from such 
plasmids and lloned into the Sacl site of pPOlOl. The resulting plasmids were 
termed pPO104-Rl, pPO104-R2, and pPO104-R3, respectively. 

20 Plasmid pJD220SVHY (Dougherty and Temin, 19986) was digested 

with Sspl and Sacl. After Klenow poll treatment, a linker coding for the 
recognition site of the restriction enzyme Bgl2 was ligated. This procedure 
removed the left LTR promoter, and all of the encapsidation sequences present in 
this vector. The resulting plasmid was termed pPO106. DNA fragments derived 

25 from plasmids pPO104-Rl, pPO104-R2, and pPO104-R3 comprising the CMV 
promoter, R, U5, and the encapsidation sequence (Bgl2 fragments) were isolated 
and inserted into plasmid pPO106 digested with Bgl2. The resulting plasmids were 
termed pPOlll-Rl, pP0111-R2, and pPO!ll-R3, respectively. The universal 
retrovirus vectors pPOH5-Rl and PPOl 15-R2 were made by replacing the SV40 

30 promoter and hygromycin B phosphotransferase gene of pPOlll-Rl and pPOlll- 
R2 (Eagl to Clal fragment) with the multiple cloning site (Eagl to Clal) of 
pBluescript II KS. 

35 
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DNA sequence pPOlll-Rl 5528 b.p. complete sequence; 

CAGATCT GAG CTT6C CATT GCATACGT T GTATCCAT AT CATAA T AT GTACATTTATATTGGCTCA 

TGTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGG 
5 GTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTG 

GCTGACCGCCCAACGACCCCC6CCCATTGACGTCAATAATGACGTATGTTCCCA7AGTAACGCCA 

ATAGGGACTTTCCATT6ACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTCGCAGTACA 

TCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGC 

ATTATGCCCAGTACATGACCTTATGGGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 
10 CCTATTACCATGGTGATGCCGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACG 

GGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGG 

ACTTTCCAAAATGTCG7AACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGG 

GAGGTCTATATAAGCAGAGCTCATCTCTTGCTCGGGGTCGCCGTCCTGCACATTGTTGTTGTGAC 

GTGCGGCCCAGATTCGAATCTGTAATAAAACTTTTTTTTTTCTGAATCCTCAGATTGGCAGTGAG 
1 5 AGGAGATTT TGTT CGTG GT GTT GCCTGGCCT ACT GGGT GG GCGCAG G GA T CC GGACT G AAT CCGT 

AGTACTTCGGTACAACATTTGGGGGCTCGTCCGGGATACCCTCCCCATCGGCAGAGGTGCCAACT 

GCTTCTTCGAACTTTCTTCGAACTCCGGCGCCGGTGAGTTAAGTACTTGATTTTGGTACCTCGCG 

AGGGTTTGGGAGGATCGGAGTGGTGGCGGGACGCTGCCGGGAAGCTCCACC7CCGCTCAGCAGGG 

GACGCCCTGACCT6AGCTCGAATTCAGATCTTGTGGTATCTGATTGTTGTTGAGCCGTCCCTAAG 
20 ACGGTGATACTAAGTCGTGGCTTGTGTGTTTGTTTGTTGCCTTGTGT7TGT7CGTCGTTTGTCGA 

CAGCCCCTTGCGAATTGGTGTACCCACACCGCGCGGCTTGCGAATAATACTTTGGAGAGCCTTTT 

GCCTCCAGTGTCTTCCGTCTGTACTCGTCCTCCTCTCCCTCTCCGGCGGGGAGAGCTCGCCCGGG 

GATCCTCTAGTATCGTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATAC 

ACT CCG CTATCGCTAC GJ GACTGGGT CAT GGCTGCGCCC CGACACC C G C CAACACCCG CT GACGC 

2 5 GCCCTGACGGGCTTGTC^GCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCT 

GCATGT6TCA6AGGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGCTGTGGAATGTGTGTCAG 
TTAGGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTA 
GTCAGCAACCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACC 
ATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCC 

3 0 CCATGGCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCC 

AGAAGTAGTGAGGAGGCTTTTTTGGAGGCCTAGCCTTTTGCAAAAAGCTCTAGAGTCGACCTGCA 
GCCCAAGCTGATCCCCGGGGGCAATGAGATATGAAAAA6CCTGAACTCACCGCGACGTCTGTCGA 
GAAGTTTCTGATCGAAAAGTTCGACAGCGTCTCCGACCTGAT6CAGCTCTCGGAGGGCGAAGAAT 
CTCGTGCTTTCAGCTTCGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGCGCCGAT 
35 GGTTTCTACAAAGATCGTTATGTTTATCGGCAtrrTTGCATCGGCCGCGCTCCCGATTCCGGAAGT 
GCTTGACATTCGGCAATTCACCGAGAGCCTGACirrATTGCATCTCCCGCCGTGCACAGGGTGTCA 
CGTTGCAAGACCTGCCTGAAACCGAACTGCCCGCTGTTCTCCAGCCGGTCGCGGAGGCCATGGAT 
GCGATCGCTGCGGCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGG 
TCAATACACTACATGGCGT6ATTTCATATGCGCGATTGCTGATCCCCATGTGTATCACTGGCAAA 

4 0 CTGTGATG6ACGACACCGTCAGTGCGTCCGTCGCGCAGCCTCTCGATGAGCTGATGCTTTGGGCC 

GAGGACTGCCCCGAAGTCC&GCACCTCGTGCACGCGGATTTCGGCTCCAACAATGTCCTGACGGA 

CAATGGCCGCATAACAGCGGTCATTGACTGGAGCGAGGCGATGTTCGGGGATTCCCAATACGAGG 

TCGCCAACATCTTCTTCTGGAGGCCGTGGTTGGCTTGTATGGAGCAGCAGACGCGCTACTTCGAG 

CGGAGGCA7CCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATATGCTCCGCATTGGTCTTGA 
4 5 CCAACTCTATCA6AGCTTGGTTGACG6CAATTTCGATGATGCAGCTTGGGCGCAGGGT CGATGCG 

ACGCAATCGTCC6ATCCGGAGCCGGGACTGTCGCGCGTACACAAATCGCCCGCAGAACCGCGGCC 

GTCTGGACCGATGGCTGTGTAGAAGTACTCGCCGATAGTGGAAACCGACGCCCCAGCACTCGTGG 

GATCAGCTTCGCACTC6AATTAATTCTCATGTTTGACAGCTTATCATCGATAAGCTC7AGG7CAA 

TGGTTTGACCAGAACATACAAGAGCAGTGGGGAATGTGGGAGGGCCTCGAGGTCGGGG7CGCCGT 
50 CC7 ACACAT7GTTGTT GT GACGT G C G G C CC AGAT TCGAATCT GTAATAAAACCT7TTT77AAC7G 

AATCCTCAGATTGGCAGTGAGAGGAGATTTTGTTCGTGGTGTTGGCTGGCCTACTGGG7GGGCGC 

AGGGATCGATCCAGACATGATAA6ATACATTGATGAGTTTGGACAAACCACAACTAGAA7GCAGT 

CAAAAAAATGCTTTATTTGTGAAATTTCTGATGCTATTGCTTTATTTGTAACCA77A7AAGCTGC 

AATAAACAAGTTAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTG7GGGA 
55 GGTTTTTTAAAGCAAGTAAAACCTCTACAAATCAAGCTGATCCCTGCC7CGCGCGTTTCGG7GAT 

GACGG7GAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCCGATGC 

CCGGAGCAGACAAGCCCGTCAG6GCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGA 

CCCAGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTAC 
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TGAGAG7GCACCATAT6CGG7GTGAAATACCGCACAGA7GCGTAA6GAGAAAATACCGCA7CAGG 

CGCTCTTCCGCTTCCtCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATC 

AGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGT 

GAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGG 
5 CTCCGCCCCCCTGACGAGCATCACAAAAA7CGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGG 

ACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGC 

CGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGT6GCGCT7TCTCATAGCTCACGC 

TGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGT 

7CAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACT 
10 7ATCGCCACTGGCA6CAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACA 

GAGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTA7TTGGTATCTGCGCTCT 

GC7GAAGCCAGTTACC7TCGGAAAAAGAG77GG7AGC7CT7GATCCGGCAAACAAACCACCGCTG 

G7AGCGG7GGT77T777G777GCAAGCAGCAGA7TACGCGCAGAAAAAAAGGA7CTCAAGAAGAT 

CC777GA7CT777CTACGGGG7CTGACGC7CAG7GGAACGAAAAC7CACGT7AAGGGAT777GG7 
15 CA7GAGAT7A7CAAAAAGGA7C77CACC7AGA7CC7777AAAT7AAAAATGAAG7777AAA7CAA 

7C7AAAGTA7A7ATGAG7AAACT7GG7C7GACAG77ACCAA7GC7TAATCAG7GAGGCACCTA7C 

7CAGCGATC7G7CTA777CG77CA7CCATAG77GCCTGACTCCCCGTCGTG7AGA7AAC7ACGAT 

ACGGGAGGGC77ACCATC7GGCCCCAG7GC7GCAA7GA7ACCGCGAGACCCACGC7CACCGGCTC 

CAGA7T7A7CAGCAA7AAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGG7CC7GCAACTT7A 
20 7CCGCC7CCA7CCAG7C7A77AA77G77GCCGGGAAGCTAGAG7AAGTAGT7CGCCAGTTAA7AG 

777GCGCAACGT7GT7GCCAT7GCTGCAGCCA7CG7GG7GTCACGCTCGTCG777GGTA7GGC7T 

CAT7CAGCTCCCGT7CCCAACGA7CAAGGCGAGT7ACA7GA7CCCCCATGT7G7GCAAAAAAGCG 

G77AGC7CC77CGG7CCTCCGA7CG77G7CAGAAG7AAGT7GGCCGCAGTGT7A7CACTCATGGT 

7ATGGCAGCAC7GCA7AA77C7C77AC7GTCATGCCA7CCG7AAGATGCT7T7C7G7GAC7GG7G 
25 AG7AC7CAACCAAG7C^77C7GA6AA7AG7G7ATGCGGCGACCGAGTTGCTC77GCCCGGCG7CA 

ACACGGGA7AA7ACCGCGCCACA7AGCAGAAC777AAAAGTGC7CATCATTGGAAAACG77C77C 

GCGGCGAAAAC7CTCAAGGA7C77ACCGC7G77GAGA7CCAG7TCGATGTAACCCAC7CG7GCAC 

CCAAC7GA7CTTCAGCA7C7777AC777CACCAGCGT77C7GGGTGAGCAAAAACAGGAAGGCAA 

AA7GCCGCAAAAAAGGGAATAAGGGCGACACGGAAATG7TGAATACTCATAC7C77CC77TTTCA 
30 A7C 



DNA sequence pPOHl-R2 5525 b.p. complete sequence; 

3 5 CAGA7C7GAGC77GCCA7TGCA7ACG77G7A7CCA7A7CATAATATGTACA777A7A77GGCTCA - 

T67CCAACA77ACCGCCATGT7GACA77GA77A77GAC7AG77AT7AATAG7AATCAAT7ACGGG 
G7CA7TAG77CA7AGCCCATA7A7GGAG77CCGCG77ACATAACTTACGGTAAATGGCCCGCC7G 
GC7GAC CGCCCAACGACCCC C GCCC A7 7 GA CG7CAA7 AA7 GACG7 AT G T T C C C A7AGT AACGCCA 
A7AGGGACT77CCAT7GACG7CAA7GGG7GGAG7A777ACGG7AAACTGCCCACT7GGCAG7ACA 

40 7CAAG7G7A7CA7ATGCCAAG7ACGCCCCC7AT7GACG7CAA7GACGGTAAA7GGCCCGCC7GGC 
A77A7GCCCAG7ACA7GACCT7ATGGGACT77CCTACT7GGCAGTACATCTACG7ATTAG7CA7C 
GCTA77ACCATGG7GA7GCGGT7TTGGCAG7ACATCAATGGGCGTGGATAGCGG777GACTCACG 
GGGA777CCAAGTC7CCACCCCAT7GACG7CAA76GGAGTTTG7T7TGGCACCAAAA7CAACGGG 
ACTT7CCAAAATCTCGTAACAACTCCGCCCCAT7GACGCAAA7GGGCGGTAGGCGTG7ACGGTGG 

45 GAGG7CTA7ATAAGCAGAGCTCTC77GC7CGGGGTCGCCG7CC7GCACAT7G7TGT7GTGACG7G 
CGGCCCAGA7TCGAA7CTGTAATAAAAC77TT7TT7TTCTGAA7CC7CAGAT7GGCAGTGAGAGG 
AGAT777G7TCG7GG7G7TGCC7GGCC7AC7GGG7GGGCGCAGGGATCCGGAC7GAA7CCG7AG7 
ACT7CGG7ACAACATTTGGGGGCTCGTCCGGGA7ACCC7CCCCATCGGCAGAGG7GCCAAC7GC7 
7C77CGAAC7TTCTTCGAAC7CCGGCGCCGG7GAG7TAAGTACTTGAT777GG7ACCTCGCGA6G 

50 G777GGGAGGA7CGGAGTGG7GGCGGGACGC7GCCGGGAAGC7CCACCTCCGCTCAGCAGGGGAC 
GCCC7GACCTGAGC7C6AA7TCAGATC77GTGGTA7CTGAT7GTTGT7GAGCCGTCCCTAAGACG 
G7GA7AC7AAGTCGTG6C77G7G7G777G77TGT7GCCTTGTGT77GT7CGTCGTTTGTCGACAG 
CGCCT7GCGAA7TGG7GTACCCACACCGCGCGGC77GCGAA7AATAC777GGAGAGCCTT7TGCC 
7CCAG7G7C77CCG7CTGTAC7CGTCC7CC7C7CCCTCTCCGGCCGGGAGAGC7CGCCCGGGGAT 

55 CCT C7AG7A7GG7GCAC7 CT CAG7 AC AAT C 7 GCTC7GAT GCCG CA T AG T 7 AA G CCAGT A7 ACACT 

CCGCTA7CGC7ACG7GAC7GGG7CA7GGC7GCGCCCCGACACCCGCCAACACCCGCTGACGCGCC 
CTGACGGGC77G7CTGCTCCCGGCA7CCGC77ACAGACAAGCTGTGACCG7CTCCGGGAGC7GCA 
7GTG7CAGAGG7TTTCACCG7CA7CACCGAAACGCGCGAGGCAGC7GTGGAA7G7GTGTCAGT7A 
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GGGTGTCGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTC 
AGCAACCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATA 
GTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAAC7CCGCCCAGTTCCGCCCATTCTCCGCCCCA 
TGGCTGACTAATTTTTTTTATTTATGCAGA6GCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGA 
AGTAGTGAGGAGGCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCTAGAGTCGACCTGCAGCC 
CAAGCTGATCCCCGGGGGCAATGAGATATGAAAAAGCCTGAACTCACCGCGACGTCTGTCGAGAA 
GTT TCTGAT CGAAAAGTTCGACAGCGT CT CCGACCT GAT GCAG CT CT C GG AGGGC GAA GAAT CTC 
6TGCTTTCAGCTTCGATGTAGGAGGGCGTGGATATGTCCTGCGGGTAAATAGCTGCGCCGATGGT 
TTCTACAAAGATCGTTATGTTTATCGGCACTTTGCA7CGGCCGCGCTCCCGATTCCGGAAGTCCT 
TGACATTCGGGAATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCACGT 
TGCAAGACCTGCCTGAAACCGAACTGCCCGCTGTTCTGCAGCCGGTCGCGGAGGCCATGGATGCG 
ATCGCTGC6GCCGATCTTAGCCAGACGAGCGGGTTCGGCCCATTCGGACCGCAAGGAATCGGTCA 
ATACACTACATGCCGTGATTTCATATGCGCGATTGCTGATCCCCATGTGTATCACTGGCAAACTG 
TGATGGACGACACCG7CAGTGCGTCCGTCGCGCAGGCTCTCGATGAGCTGATGCTTTGGGCCGAG 
6ACTGCCCC6AAGTCCGGCACCTCGTGCACGCGGATTTCGGCTCCAACAATGTCCTGACGGACAA 
TGGCCGCATAACAGCGGTCATTGACTGGAGCGAGGCGATGTTCGCGGATTCCCAATACGAGGTCG 
CCAACATCTTCTTCTGGAGGCCGTGGTTGGCTTGTATGGAGCAGCAGACGCGCTACTTCGAGCGG 
AGGCATCCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATATGCTCCGCATTGGTCTTGACCA 
ACTCTATCAGAGCTTGGTTGACGGCAATTTCGATGATGCAGCTTGGGCGCAGGGTCGATGCGACG 
CAATCGTCCGATCCGGAGCCGGGACTGTCGGGCGTACACAAATCGCCCGCAGAAGCGCGGCCGTC 
TGGACCGATCGCTGTGTAGAAGTACTCGCCGATAGTGGAAACCGACGCCCCAGCACTCGTG6GAT 
CAGCTTGGCACTGGAAT7AATTCTCATGTTTGACAGCTTATCATCGATAAGCTCTAGGTCAATGG 
TTTGACCAGAACATACAAGAGCAGTGGGGAATGTGGGAGGGCCTCGAGGTCGGGGTCGCCGTCCT 
ACACATTCTTGTTGTGACGTGCGGCCCAGATTCGAATCTGTAATAAAACCTTTTTTTAACTGAAT 



GATCGATCCAGACATGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAA 
AAAAATGCTTTATT7GTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAAT 
AAACAAGTTAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGT 
TTTTTAAAGCAAGTAAAACCTCTACAAATCAAGCTGATCCCTGCCTCGCGCGTTTCGGTGATGAC 
GGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGG 
GAGCAGACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCC 
AGTCACGTAGCGATAGCGGAGTGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGA 
GAGTGCACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCAT CAGGCGC 
TCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGC 
TCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAG 
CAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTC 
CGCCCCCCT GACGAG CAT CACAAAAATCGAC GCTCAAGTCA GAGGT G G C G AAACCCGA CAG GACT 
ATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGC 
TTACCGGATACCTGTCC6CCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGT 
AGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCA 
GCCCGACC6CTGCGCCTTATCC6GTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTAT 
CGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAG 
TTCTTGAAGTG6TGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCT 
GAAGC CAGTT ACCTT CGGAAAAAGAGTTGGT AGC T CT TGAT C C GGCAAA CAAACCACC GCTGGTA 
GCGGT G GTT T TT T TGTTT G CAAGCAGCAGATT ACGCGCAGAAAAAAA G GAT C T CAAGAAGAT CCT 
TTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCAT 
GAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCT 
AAAGT ATATATGAGT AAACTT GGT CTGACAGTTACCAAT GCTTAATCAGTGAGG CACCT AT CTCA 
GCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACG 
GGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAG 
ATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCACAAGTGGTCCTGCAACTTTATCC 
GCCTCCATCCA6TCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTT 
GCGCAACGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCAT 
TCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTT 
AGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTAT 
GGCAGCACTGCATAATTCTCTTACTGTCAT6CCATCCGTAAGATGCTTTTCTGTGACTGGTGAGT 
ACTCAACCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACA 
CGGGATAATACCGCGCCACATAGCAG AACT T TAAAAGT GC T CAT CA T T G GAAAACGT T CT TCGGG 
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GCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCA 
ACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAAT 
GCCGCAAAAAAGGGAATAAGG6CGACACGGAAATGTTGAAT ACT CAT A CT CT T CCTT 7 T T CAATC 

5 

DNA sequence pP0111-R3 5519 b.p. complete sequence; 

CAGATCTGAGCTTGCCATTGCATACGTTGTATCCATATCATAATATGTACATTTATATTGGCTCA 
TGTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGTTATTAATAGTAATCAATTACGGG 
10 GTCATTAGTTCATAGCCCATATATGGAGTTCCGCGTTACATAACTTACGGTAAATGGCCCGCCTG 
GCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAATGACGTATGTTCCCATAGTAACGCCA 
ATAGGCACTTTCCATTGACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACA 
TCAAGTGTATCATATGCCAAGTACGCCCCCTATTGACGTCAATGACGGTAAATGGCCCGCCTGGC 
ATTATGCCCAGTACAT6ACCTTAT6GGACTTTCCTACTTGGCAGTACATCTACGTATTAGTCATC 
15 GCTATTACCATGGTGATGCGGTTTTGGCAGTACATCAATGGGCGTGGATAGCGGTTTGACTCACG 
GGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTTTGTTTTGGCACCAAAATCAACGGG 
ACTTTCCAAAATGTCGTAACAACTCCGCCCCATTGACGCAAATGGGCGGTAGGCGTGTACGGTGG 
GACGTCTATATAAGCAGAGCTCTCGGGGTCGCCGTCCTGCACATTGTTGTTGTGACGTGCGGCCC 
AGATTCGAATCTGTAATAAAACTTTTTTTTTTCTGAATCCTCAGATTGGCAGTGAGAGGAGATTT 
20 TGTTCGTGGTGTTGCCTGGCCTACTGGGTGGGCGCAGGGATCCGGACTGAATCCGTAGTACTTCG 
GTACAACATTTG6GGGCTCGTCCGGGATACCCTCCCCATCGGCAGAGGTGCCAACTGCTTCTTCG 
AACTTTCTTCGAACTCCGGCGCC6GTGAGTTAAGTACTTGATTTTGGTACCTCGCGAGGGTTTGG 
GAGGATCGGAGTGGTGGCGGGACGCTGCCGGGAAGCTCCACCTCCGCTCAGCAGGGGACGCCCTG 
ACCTGAGCTCGAATTCAGATCTTGTGGTATCTGATTGTTGTTGAGCCGTCCCTAAGACGGTGATA 
25 CTAAGTCGTGGCTTGK1TGTTTGTTTGTTGCCTTGTGTTTGTTCGTCGTTTGTCGACAGCGCCTT 
GCGAATTGGTGTACCCACACCGCGCGGCTTGCGAATAATACTTTGGAGAGCCTTTTGCCTCCAGT 
GTCTTCCGTCTGTACTCGTCCTCCTCTCCCTCTCCGGCCGGGAGAGCTCGCCCGGGGATCCTCTA 
GTAT6GTGCACTCTCAGTACAATCTGCTCTGATGCCGCATAGTTAAGCCAGTATACACTCCGCTA 
TCGCTACGTGACTGGGTCATGGCTGCGCCCCGACACCCGCCAACACCCGCTGACGCGCCCTGACG 
30 GGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCCGGGAGCTGCATGTGTC 
AGAGGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGCTGTGGAATGTGTGTCAGTTAGGGTGT 
GGAAA GTCCCCAGG CTCCCCAGCAGGCA G AA GTATGC AAAGCATGCA T CT C AA TTAGT CAGCAAC 
CAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTAGTCAGCAACCATAGTCCCG 
CCCCTAACTCCCCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCAT66CTG 
35 ACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGT 
GACGAGCCTTTTTTGGAGGCCTAGGCTTTTGCAAAAAGCTCTAGAGTCGACCTGCAGCCCAAGCT 
GATCCCCGGGGGCAATGAGATATGAAAAAGCCTGAACTCACCGCGACGTCTGTCGAGAAGTTTCT 
GATCGAAAAGTTCGACAGCGTCTCCGACCTGATGCAGCTCTCGGAGGGCGAAGAATCTCGTGCTT 
TCAGCTTCGATGTAGGAGGGCCTGGATATGTCCTGCGGGTAAATAGCTGCGCCGATGGTTTCTAC 
40 AAAGATCGTTATGTTTATCGGCACTTTGCATCGGCCGCGCTCCCGATTCCGGAAGTGCTTGACAT 
TGGGGAATTCAGCGAGAGCCTGACCTATTGCATCTCCCGCCGTGCACAGGGTGTCACGTTGCAAG 
ACCTGCCTGAAACC6AACTGCCCGCTGTTCTGCAGCCGGTCGCGGAG6CCATGGATGCGATCGCT 
G CGGCCGATCTTAG CCAGACGAGCGGGT T CGGCCCATTCGGACCG CAA GGAATCGGT CAAT ACAC 
TACATGGCGTGATTTCATATGCGCGATTGCTGATCCCCATGTGTATCACTGGCAAACTGTGATGG 
45 ACGACACCGTCAGTGCGTCCGTCCCGCAGGCTCTCGATGAGCTGATGCTTTGGGCCGAGGACTGC 
CCCGAAGTCCGGCACCTCGTCCACGCCGATTTCGGCTCCAACAATGTCCTGACGGACAATGGCCG 
CATAACAGCGGTCATTGACTGGAGCGAGGCGATGTTCGGGGATTCCCAATACGAGGTCGCCAACA 
TCTTCTTCTGGAGGCCGTGGTTGGCTTGTATGGAGCAGCAGACGCGCTACTTCGAGCGGAGGCAT 
CCGGAGCTTGCAGGATCGCCGCGGCTCCGGGCGTATATGCTCCGCATTGGTCTTGACCAACTCTA 
50 TCAGAGCTTGGTTGACGGCAATTTCGATGATGCAGCTTGGGCGCAGGGTCGATGCGACGCAATCG 
TCCGATCCGGAGCCGGGACTGTCGGGCGTACACAAATC6CCCGCAGAAGCGCGGCCGT CTGGACC 
GATGGCTGTGTAGAAGTACTCGCCGATAGTGGAAACC6ACGCCCCAGCACTCGTGGGATCAGCTT 
GGCACTGGAATTAATTCTCATGTTTGACAGCTTATCATCCATAAGCTCTAGGTCAATGGTTTGAC 
CAGAACATACAAGAGCAGTGGGGAATGTGGGAGGGCCTCGAGGTCGGGGTCGCCGTCCTACACAT 
55 TGTTGTTGT6ACGTGCGGCCCAGATTCGAATCTGTAATAAAACCTTTTTTTAACTGAATCCTCAG 
ATTGGCAGT6AGAGGAGATTTTGTTCGTGGTGTTGGCTGGCCTACTGGGTGGGCGCAGGGATCGA 
TCCAGAWTGATAAGATACATTGATGAGTTTGGACAAACCACAACTAGAATGCAGTGAAAAAAAT 
GCTTTATTTGTGAAATTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAA 
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GTTAACAACAACAATTGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTA 
AAGCAAGTAAAACCTCTACAAATCAAGCTGATCCCTGCCTCGCGCGTTTCGGTGATGACGGTGAA 
AACCTCTGACACATGCAGCTCCCGGAGACGGTCACAGCTTGTCTG7AAGCGGATGCCGGGAGCAG 
ACAAGCCCGTCAGGGCGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCAC 
5 GTAGCGATA6C6GAGTGTATACT6GCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGC 
ACCATATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCC 
GCTTCCTCCCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTC 
AAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAG 
GCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCC 

10 CCT6ACGAGCATCACAAAAATCGACGCTCAAGTCA6AGGTGCCGAAACCCGACAGGACTATAAAG 
ATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCG 
GATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGT6GCGCTTTCTCATAGCTCACGCTGTAGGTAT 
CTCAGTTC6GTGTAG6TCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGA 
CCGCTGCGCCTTA7CCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCAC 

15 TGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATCTAGGC6GTGCTACAGAGTTCTTG 
AAGT6GTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCC 
AGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTG 
GTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATC 
TTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACCTTAAGGGATTTTGGTCATGAGATT 

2 0 ATCAAAAAG6ATCTTCACCTAGATCCTTTTAAATTAAAAATCAAGTTTTAAATCAATCTAAAGTA 

TATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATC 
TGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGG 
CTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTAT 
CAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTCG7CCTGCAACTTTATCCGCCTCC 
25 ATCCAGTCTATTAATTGJTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAA 
CGTTGTTGCCATTGCTGCAGGCATCGTGGTGTCACGCTCGTCGT77GGTATGGCTTCATTCAGCT 
CCGGTTCCCAACGA7CAAGGCGA6TTACATGATCCCCCATG7TG7GCAAAAAAGCGGT7AGCTCC 
TTCGGTCC7CCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGC 
ACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAA 

3 0 CCAAG7 CATT CT GAGAATAGTGTAT G CGGCGACC GAGT7 GCTC7 7 GC C C G GCGTCAACAC GGGAT 

AATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAA 
ACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCAC7CGTGCACCCAACTGAT 
CT TC AGCATCTTTT ACTTTCACCAGCGTTT CT G G GT GAGCAAAAACA G GAAGGCAAAA 7 G C CGC A 
AAAAAGGCAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATC 

35 

DNA sequence PPOl 15-R1 3878 b.p. complete sequence; 

CAGATCTGA6CT7GCCATTGCATACGTTGTATCCATATCATAATATGTACATTTATATTGGCTCA 

40 7GTCCAACATTACCGCCATGTTGACATTGATTATTGACTAGT7AT7AATAG7AATCAATTACGGG 
GT CATTAGTT CAT AGCCCA7ATATGGAGT T C CGCG7T ACAT AACT 7ACGG7AAAT GGC CC GCCT G 
GCTGACCGCCCAACGACCCCCGCCCATTGACGTCAATAA7GACGTATGTTCCCATAGTAACGCCA 
ATAGGGACTTTCCA77GACGTCAATGGGTGGAGTATTTACGGTAAACTGCCCACTTGGCAGTACA 
T C AAGT GT ATCAT AT GCCAAGTACGCCCCC T AT T GACGTCAA7 GAC GG7 AAAT GG CCC GC CT GGC 

45 ATTATGCCCAGTACA7GACC7TATGGGACTTTCCTACT7CGCAGTACATCTACGTAT7AGTCATC 
GCTATTACCATGGTGATGCGGTT7TGGCAGTACATCAA7GGGCGTGGATAGCGGTTTGACTCACG 
GGGATTTCCAAGTCTCCACCCCATTGACGTCAATGGGAGTT7G777TGGCACCAAAATCAACGGG 
ACTTTCCAAAATCTCGTAACAACTCCGCCCCATTGACGCAAATGGGCnCJTAGGCGTGTACGGTCG 
GAGGTCTATATAAGCAGACCTCATCTCTTGCTCGGGGTCGCCGTCCTGCACATTGTTGTTGTGAC 

50 GTGCGGCCCAGATTCGAATCTGTAATAAAACTTTTTTTTTTCTGAATCCTCAGATTGGCAGTGAG 
AGGAGATTTTGTTCGTGGTGTTGCCTGGCCTACTGGGTGGGCGCAGGGATCCGGACTGAATCCGT 
AGTACTTCGG7ACAACA777GGGGGCTCGTCCGGGATACCCTCCCCATCGGCAGAGGTGCCAACT 
GCTTCTTCGAACTTTCTTCGAACTCCGGCGCCGGTGAGTTAAGTACTTGATTTTGGTACCTCGCG 
AGGGTTTGGGAGGATCGGAGTGGTGGCGGGACGCTGCCGGGAAGCTCCACCTCCGCTCAGCAGGG 

55 GACGCCCTCACCTCAGCTCTGTGGTATCTGATTCTTGTTGAGCCGTCCCTAAGACGG7GATACTA 
AGTCGTGGCTTGTGTGTTTGTTTGTTGCCTTGTGTTTGTTCGTCGTTTGTCGACAGCGCCTTGCG 
AATTGGTGTACCCACACCGCGCGGCTTGCGAATAATACTTTGGAGAGCCTTTTGCCTCCAGTGTC 
TTCCGTCTCTACTCGTCCTCCTCTCCCTCTCCGGCCGCTCTAGAACTAGTGGATCCCCCGGGCTG 
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CAGGAATTCGATATCAAGC7TATCGATAAGCTCTAGGTCAATGGTTTGACCAGAACATACAAGAG 
CAGTGGGGAATGTGGGAGGGCCTCGAGGTCGGGGTCGCCGTCCTACACATTGTTGTTGTGACGTG 
CGGCCCAGATTCGAATCTG7AATAAAACCTTTTTTTAACTGAATCCTCAGATTGGCAGTGAGAGG 
AGATTTTGTTCGTGGTGTTGGCTGGCCTACTGGGTGGGCGCAGGGATCGATCCAGACATGATAAG 
5 ATACATTGATGACTTTGGACAAACCACAACTAGAATGCAGTCAAAAAAATGCTTTATTTGTGAAA 
TTTGTGATGCTATTGCTTTATTTGTAACCATTATAAGCTGCAATAAACAAGTTAACAACAACAAT 
TGCATTCATTTTATGTTTCAGGTTCAGGGGGAGGTGTGGGAGGTTTTTTAAAGCAAGTAAAACCT 
CTACAAATCAAGCTGA7CCCTGCCTCGCGCGTTTCGGTGAT6ACGGTGAAAACCTCTGACACATG 
CAGCTCCCGGAGACGGTCACAGCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGG 

10 CGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCGCAGCCATGACCCAGTCACGTAGCGATAGCGGAG 
TGTATACTGGCTTAACTATGCGGCATCAGAGCAGATTGTACTGAGAGTGCACCATATGCGGTGTG 
AAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGCTCTTCCGCTTCCTCGCTCACT 
GACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACG 
GTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCA 

15 GGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCAC 
AAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCC 
CCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCT 
TTCTCCCTTCGGCAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAG 
CTCGTTCGCTCCAAGCTCGGCTGTCTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATC 

20 CGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTG 
GTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAAC 
TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAA 
AAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCA 
AGCAG CAGATTACGCGCAGAAAAAAAGGAT CT CAA GAAGAT CCT TTGATCTTTTCTACGGGGTCT 

25 GAC GCT CAGTGGAAC^AAACTCACGTT AA G GG ATT T T GGT CAT GA GATTAT CAAAA A G GAT CT T 

CACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTT 
GGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCA 
TCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCC 
CAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGC 

30 CAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAAT 
TGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGT7TGCGCAACGTTGTTGCCATTGC 
TGCAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAAC6AT 
CAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATC 
GTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCT 

35 TACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGACTACTCAACCAAGTCATTCTCAG 
AATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGATAATACCGCGCCACAT 
AGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTT 
ACCGCT GT T GAGATCCAGTTCGAT GTAAC CCACT CGT GCA CCCAACTGA T C TT CAGCA T CTT TTA 
CTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGG 

4 0 GCGACACGCAAATGTTGAATACTCATACTCTTCCTTTTTCAAT 
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I claim: 

L A recombinant retrovirus vector having a U3-free 5* LTR, a 
partially deleted 3 f LTR, all essential cis-acting sequences for replication, an 
5 internal promoter recognizable by a selected host cell, and a non-retroviral gene 
under the control of the recognized promoter wherein: 

(a) the U3-free 5* LTR is positioned 5' of the non-retroviral gene 
and has a transcriptional promoter and enhancer different from that of the original 
retroviral promoter and enhancer replacing the original U3 region of the 5'LTR; 
10 (b) the partially deleted 3* LTR is positioned 3* of the non-retroviral 

gene and has no U3 sequences except for those required at the attachment site for 
viral integration; 

(c) an exogenous poly adenylate on addition signal sequence 
recognized by the selected host cell and positioned on the vector 3 1 to the 3* LTR 
15 viral integration site; and 

Yd) the internal recognized promoter is positioned adjacent to the 
non-retroviral gene on the vector to permit expression of the non-retroviral gene in 
the host cell; 

whereby the vector can produce progeny virus in a helper cell with 
20 the progeny virus being capable of infecting the selected host cell and forming a 
provirus in the host cell, with the non-retroviral gene being expressible in the host 
cell, but the provirus in the host cell will be replication incompetent even in the 
presence of a helper virus. 

25 2. A retrovirus produced by the recombinant retrovirus vector of 

claim 1. 



3. A host cell containing the provirus of claim 1. 
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FIGURE 1 
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